Analyzing neural network behavior through deep statistical model checking

نویسندگان

چکیده

Abstract Neural networks (NN) are taking over ever more decisions thus far taken by humans, even though verifiable system-level guarantees out of reach. Neither is the verification technology available, nor it understood what a formal, meaningful, extensible, and scalable testbed might look like for such technology. The present paper an attempt to improve on both above aspects. We family formal models that contain basic features automated decision-making contexts which can be extended with further orthogonal features, ultimately encompassing scope autonomous driving. Due possibility model random noise in decision actuation, each instance induces Markov process (MDP) as object. NN this context has duty actuate (near-optimal) decisions. From perspective, externally learnt serves determinizer MDP, result being chain amenable statistical checking. combination MDP encoding action policy central we call “deep checking” (DSMC). While straightforward extension checking, enables gain deep insight into questions “how high NN-induced safety risk?”, good compared optimal policy?” (obtained checking MDP), or “does training NN?”. report implementation DSMC inside Modest Toolset NNs, demonstrating potential various instances family, illustrating its scalability function size well other factors degree training.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Better Railway Engineering Through Statistical Model Checking

Maintenance is essential to ensuring the dependability of a technical system. Periodic inspections, repairs, and renewals can prevent failures and extend a system’s lifespan. At the same time, maintenance incurs cost and planned downtime. It is therefore important to find a maintenance policy that balances cost and dependability. This paper presents a framework, fault maintenance trees (FMTs), ...

متن کامل

Analyzing Industrial Simulink Models by Statistical Model Checking

The evolution of automotive systems has been rapid. Nowadays, electronic brains control dozens of functions in vehicles, like braking, cruising, etc. Model-based design approaches, in environments such as MATLAB Simulink, seem to help in addressing the ever-increasing need to enhance quality, and manage complexity, by supporting functional design from predefined block libraries, which can be si...

متن کامل

TBD: Benchmarking and Analyzing Deep Neural Network Training

The recent popularity of deep neural networks (DNNs) has generated a lot of research interest in performing DNN-related computation efficiently. However, the primary focus is usually very narrow and limited to (i) inference – i.e. how to efficiently execute already trained models and (ii) image classification networks as the primary benchmark for evaluation. Our primary goal in this work is to ...

متن کامل

Checking & Distributing Statistical Model Checking

In this paper we propose a general framework for distributed statistical model checking of networks of priced timed automata. The first contribution is a new algorithm to distribute sequential hypothesis testing without introducing bias in the results. The second contribution is an implementation of this algorithm in Uppaal. The major contribution is an experimental and analytical evaluation of...

متن کامل

Statistical Model Checking in

Statistical model-checking is a recent technique used for both verification and performance analysis of hybrid systems. It does not suffer from decidability issues or state-space explosion compared to traditional model-checking. Furthermore, it is applicable to more powerful formalisms such as stochastic hybrid automata. Its principle is simple so how simple is it really to make it work in prac...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: International Journal on Software Tools for Technology Transfer

سال: 2022

ISSN: ['1433-2779', '1433-2787']

DOI: https://doi.org/10.1007/s10009-022-00685-9